Discriminating Same or Different speech: human vs machine

نویسندگان

  • Greg Kochanski
  • Christina Orphanidou
  • Burton S. Rosner
چکیده

We compare the performance of a Bayesian classifier against humans in a same/different speech perception task. The classifier is trained on different sizes of speech segments to separate sounds into perceptually equivalent and nonequivalent categories. We test the classifier with pairs of speech sounds of various sizes and compare its performance with perceptual listening tests on native speakers of Southern British English as well as Greek native speakers who speak English as a second language. We find that the performance of the classifier is comparable to that of human subjects and that the largest-sized speech segments produce the best performance overall. We also find that an edge detector in the 60-800 Hz band and an autocorrelation-based measure of voicing are very important to the performance of the classifier.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P65: Speech Recognition Based on Bbrain Signals by the Quantum Support Vector Machine for Inflammatory Patient ALS

People communicate with each other by exchanging verbal and visual expressions. However, paralyzed patients with various neurological diseases such as amyotrophic lateral sclerosis and cerebral ischemia have difficulties in daily communications because they cannot control their body voluntarily. In this context, brain-computer interface (BCI) has been studied as a tool of communication for thes...

متن کامل

Discriminating speech and non-speech with regularized least squares

We consider the task of discriminating speech and non-speech in noisy environments. Previously, Mesgarani et. al [1] achieved state-of-the-art performance using a cortical representation of sound in conjunction with a feature reduction algorithm and a nonlinear support vector machine classifier. In the present work, we show that we can achieve the same or better accuracy by using a linear regul...

متن کامل

Emotional State Categorization from Speech: Machine vs. Human

This paper presents our investigations on emotional state categorization from speech signals with a psychologically inspired computational model against human performance under the same experimental setup. Based on psychological studies, we propose a multistage categorization strategy which allows establishing an automatic categorization model flexibly for a given emotional speech categorizatio...

متن کامل

Annotate and Identify Modalities, Speech Acts and Finer-Grained Event Types in Chinese Text

Discriminating sentences that denote modalities and speech acts from the ones that describe or report events is a fundamental task for accurate event processing. However, little attention has been paid on this issue. No Chinese corpus is available by now with all different types of sentences annotated with their main functionalities in terms of modality, speech act or event. This paper describe...

متن کامل

Linguistic and Acoustic Changes of U Different Dialogue S

This paper presents the characteristic differences of acoustic and linguistic features observed in different spoken dialogue situations: human-human vs. human-machine interactions. We compare the acoustic and linguistic features of the user’s speech to a spoken dialogue system and to a human-operator in several landmark setting tasks for a car navigation system. It has been pointed out that spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007